Research in Computing Science, Vol. 68, pp. 103-114, 2013.
Abstract: In this paper an approach to primary business-media analysis for further information extraction is proposed. We consider business events representation by looking into part of speech (POS) distribution across tagged n-grams. Two Russian business-media corpora, Russian Business Consulting (RBC) and Commersant, are analyzed, and it is shown that they differ not only in style or themes coverage but also in the range of contexts for the words which mark business-events. Purchase, merger and ownership events are given a closer look at, and it is shown that they are mostly represented by noun phrases in both corpora rather than verbal phrases.
Keywords: Information extraction, business media.
PDF: Business-Media Analysis for Information Extraction
PDF: Business-Media Analysis for Information Extraction